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Abstract 



In extreme value statistics the peaks-over-threshold method is widely 
used. The method is based on the Generalized Pareto distribution ([I], 
[13] in univariate theory and e.g. [4], |15| in multivariate theory) charac- 
terizing probabilities of exceedances over high thresholds. We present a 
generalization of this concept in the space of continuous functions. We call 
this the Generalized Pareto process. Differently from earlier papers, our 
definition is not based on a distribution function but on functional prop- 

» "*! ] erties, and does not need any specification with the related max-stable 

C^ ■ process. 

As an application we use the theory to simulate wind fields connected 
to disastrous storms on the basis of observed extreme but not disastrous 
storms. 

Keywords: domain of attraction, extreme value theory, generalized Pareto 
process, max-stable processes, functional regular variation 

1 Introduction 

We say that a stochastic process X in C(S) (the space of continuous real func- 
tions on S, with S a compact subset of M. d ) is in the domain of attraction of a 



max-stable process if, there are continuous functions a n (s) positive and b n (s) 
on S such that the processes 



max r ^— - — 

i<i<n a n (s) 



s£S 



with X, X\ , . . . , X n independent and identically distributed, converge in distri- 
bution to a max-stable process Y in C(S). Necessary and sufficient conditions 
for this to happen are: uniform convergence of the marginal distributions and 
a convergence of measures (in fact a form of regular variation) : 

lim tP (T t X 6 A) = v(A) (1.1) 

t—¥oo 

where T t X(s) := (l + j{s) X{s ^ {s) ) for all s e S (with the notation 

x + = max(0,x) for any real x); v is a homogeneous (of order -1) measure on 
C+{S) := {/ e C{S) : f > 0} and A any Borel subset of C+{S) verifying 
v{dA) = and inf{sup se5 f(s) : f £ A} > (de Haan and Lin (2001), cf. de 
Haan and Ferreira (2006) Section 9.5). The functions a t (s) and b t (s) are chosen 
in such a way that the marginal distributions are in standard form: 

cxp {- (1 + 7 (s)xr 1/7(;i) } , 1 + j(s)x > 0, 

for all x G M and s € S. Here 7 is a continuous function. In particular one may 
take bt(s) := inf{x : P(X(s) < x) > 1 — 1/i}. This is how we choose bt(s) from 
now on. One possible choice of a t (s) is a t (s) := 7(s)(&2t(s) — fr*(s))/(2 7 ( s ' — 1). 
From (fL~Tj) it follows that 



P ((l+7(0 



x(-)-M-) 
»t(0 



1/7(0 



p ( o 1in -y(s)-ht(a) ^ n 

^l su P s es Q ,( S ) >u 




a t (s) 

converges as t — > 00 and so does 

, ( . l M ,tt) EJ |,*f >D ). 

at{-) /+ ses at{ s ) J 

The limit constitutes a probability distribution on C + (S). 

This reasoning is quite similar to how one gets the generalized Pareto dis- 
tributions in R (Balkema and de Haan, 1974) and in R d (Rootzen and Tajvidi, 
2006; Falk, Hiisler and Reiss, 2010). It leads to what we call generalized Pareto 
processes. 

The paper is organized as follows: The Pareto processes will be dealt with 
in Section [2] As in the finite dimensional context it is convenient to study first 
generalized Pareto processes in a standardized form. This is done in Section [2~D 
The general process is discussed in Section l2~2l In the meantime, in Section l2.1.1l 



is the discrete version of our approach to simple Pareto processes, leading to 
multivariate simple Pareto random vectors. Domain of attraction is discussed 
in Section I3J 

In Section 21 we show that by using the stability property of generalized 
Pareto processes one can create extreme storm fields starting form independent 
and identically observations of storm fields. 

In the following, operations like w± + w-i or w\ A w-i with wi, w% 6 C(S) mean 
respectively {u>i(s) + W2{s)} S £S and {wi(s)Aw2(s)} s eS- Then with abuse of no- 
tation, operations like w+x or wAx with w <E C(S) and i£R mean respectively 
{w(s) + x} sl zs &ud {w(s) A x} sS 5. Similarly for products and powers. Then 

e.g. we shall simply write (l + l^A * ^ for j (l + l(s) X(s ^f s) ) * '' 1 , 

with X = {X(s)} s( zs, a t = {a t (s)} seS , b t = {b t (s)} s< z S and 7 = {7(s)} se s- In 
general, C(S) is the space of continuous real functions on S C K rf equipped with 
the supremum norm. 



2 Pareto processes 

2.1 The simple Pareto process 

Let C + (S) be the space of non- negative real continuous functions on S, with S 
some compact subset of M. d . We denote the Borel subsets of a metric space by 
B(-). 

Theorem 2.1. Let W be a stochastic process in C + (S) and uiq a positive con- 
stant. The following three statements are equivalent: 

1. (POT - peaks-over-threshold stability) 

(a) E {W{s)l sup seS W{s)) > for all seS, 

(b) P (sup sGS W(s)/u!o > x) — x~ l , for x > 1 (standard Pareto distri- 
bution), 

(c) 

P( UoW w( ^ B\sn P W(s)>r)=p( ^.. 6 B 

Vsup seS W(s) ' seS ) Vsup sGS I^(s) 

(2.1) 
for all r > ojo and B € B \C£ (S)) with 

<5+(5) := {/ e C + (S) : sup/( S ) = c }. (2.2) 

s£S 

2. (Random functions) 

(a) P (sup ses W{s) >u ) = l, 

(b) E {W(s)/sup seS W(s)) > for all s E S, 



(c) 

P{W erA)=r- 1 P(W e A), (2.3) 

for all r > 1 and A G B (C+ (5 1 )), where rA means the set {rf, f G 
j4} ; and 

C+ (5) := {/ 6 C+(5) : rap /(a) > c }. (2.4) 

s<ES 

3. (Constructive approach) W(s) = YV(s), for all s G S, for some Y and 
V = {V(s)} seS verifying: 

(a) V G C + (S) is a stochastic process verifying sup seS V(s) — luq a.s., 
and EV(s) > for all s G S, 

(b) Y is a standard Pareto random variable, Fy{y) = 1 — 1/j/, y > 1, 
(cj y and V^ are independent. 



Definition 2.1. TTie process W characterized in Theorem \2.1l with threshold 
parameter ujq , is called simple Pareto process. The probability measure in (|2.ip 
i.e., 

P{B) = P( ^.. efl), /orBeB(Ci(S)), (2.5) 

is called the spectral measure. 

Some easy consequences of Theorem 12.1 1 3. are the following. The process 
W is stationary if and only if V is stationary. Independence at any two points 
si,s 2 G S, i.e. W(si) and W{s2) being independent, is not possible. Complete 
dependence is equivalent to V = uq a.s. We shall come back to some of these 
issues. 

Proof of Theorem 1 2. 1\ We start by proving that 1. implies 3. By compactness 
and continuity, sup seS W(s) < oo a.s. Take: 

y ^su Psg5 ^( S ) ^ y _ u W 



B- 



ujq sup seS W{s) 

Then (a), (b) and (c) are straightforward. 
Next we prove that 3. implies 2. Let 

A r .B = (/ e C+(S) : sup/(s)/u;o > r, ^TT e S l = r x A i 

{ ses sup ses f{s) J 

for all r > 1 and BeB (C+ {S)). Then, 

P (W e A r>B ) = P ( sup W(s)/u> > r, ""^ G B) 

\ses sup seS W(s) J 

= P(Y >r,VeB) = P(Y > r) P(V G B) 

1 / OJnW \ 1 

= -P sup W(a)/wo > 1, °—— - £B)=-P(We A hB ) 

r \ses sup seS W(s) ) r 



using in particular the independence of Y and V and P (sup seS W{s)/u)q > 1) = 
1. Since P(rA) = r~ 1 P{A) holds for any of the above sets, it holds for all Borcl 
sets in the statement. 

Finally, check that 2. implies 1. For any r > 1, by (c) and (a), 

p ( su PseS W(s) >r \l p ( sn PseS W(s) >l ) = l 
Also for any BeB (<7+ (5)), 

- ip (»,»(* > 1, »° W WI , «= B ) - if ( "°^, M <= B 

r Vsgs sup seS Ty(s) ) r \sm Ps ^ s W(s) 

since sup sGiS VF(s) > wo holds a.s. That is, it follows that sup seS W(s)/uj is 
univariate Pareto distributed and, sup sgS W(s) and W/sup se g W(s) are inde- 
pendent. □ 

The following properties are direct consequences: 

Corollary 2.1. For any simple Pareto process W, the random variable 
Ll)q sup sg 5 W(s) has standard Pareto distribution. 

Corollary 2.2. W G C + (S) is a simple Pareto process if and only if any of the 
two equivalent statements hold: 

1. (a) E {W{s)/ sup seS W{s)) > for all seS, 
(b) P (sup seS W(s)/u >x) = x-\ for x > I, 

(c) 

P[W erA\supW{s) >™ ) =P(We A) (2.6) 

V ses J 

for allr>landAeB (C+ (S)) . 

2. (a) E (W(s)/ sup seS W(s)) > for all seS, 
(b) 

pUm >r ,^w €B ) = m, (2 . 7) 

V sG s w sup seS W(s) ) r 

for all r>\ and B € B (<7+ (S)) . 



From (|2.6j) we see that the probability distribution of W serves in fact as 
the exponent measure in max-stable processes (cf. de Haan and Ferreira (2006), 
Section 9.3). Characterization 2. suggests ways for testing and modeling Pareto 
processes. 

We proceed to express the distribution function of W in terms of the prob- 
ability distribution of V from Theorem 12.11 3. and Definition 12.11 



Let w,W G C+(5). Conditions W < (>)w define the sets {/ G C+(5) : 
/(s) < (>)w(s) for all s G S} and (W jC w) defines the set {/ G C*+(S") : /(s) > 
w(s) for at least one s G S 1 }. Hence note that T¥ > u> is not the complement of 

Take for the conditional expectation, 

E(g(V)\V EB)= *J g (y)dp(v), BeB (C+(S)) , 

defined in the usual sense and whenever p(B) — P(V G B) > 0, with g a real 
functional (e.g. see Billingsley, 1995). 

Proposition 2.1. Let w,W G C + (S), iot£/i VF simple Pareto process. Let 
So = {s € S : w(s) — 0}, So = S\/So ^ e complement of So, and Bq = {/ G 
(7+ (5) : inf sego ^g > 1 and f(s) = for s G 5 }. Then 

P(W<w) = { P( B o){ 1 - E { su PseS ^\V€B )} l fp(Bo)>Q 
\ ifp(B )=0. 

(2.8) 

Proof. 

P(W <w)= P(W(s) < w(s) for s G S and W(s) < w(s) for s G S ) 

= P\Y < inf ^44 and V(s) = for s G S 

V ses V{s) 

( IjUi Si 1V[ Si 

= P [Y < inf — V4 and V(«) = for s G S and inf — ^4 > 1 

V seSo K(s) se so V(s) 

+P y < inf — y- and V(s) = for s G S and inf — H < 1 
V ses y(s) sS 5 ]/(s) 

= / P ( Y < inf ^) *,(„) 

Jb„ V *e.So f (s) J 

= [ i - sup 44<w = p(^o) - / sup 44^) 

iSo sGSo W ( s ) J Bo se5 w ( s ) 

= p (Bo)\l-E[sup^\VEBo)\ 
{ \ses w(s) J J 

where the last but two equality follows by the fact that the second summand in 
the previous equality is zero and, from the independence of Y and V. □ 



Corollary 2.3. Under the conditions of Provosition \2A\ 

P(W < w) = 1 - E I sup ^4 ] tf P( B o) = l - ( 2 - 9 ) 

\ses Ms) ) 

The following is obtained in the particular case of w being strictly positive: 



Proposition 2.2. Let w, W £ C + (S), with w positive and W simple Pareto 
process. Then, 

P(W <w) = E ( sup V }'^ )-E ( sup -^4^) . (2.10) 

\ seS ra(s)Awo/ Vsesw(s)/ 

Proof, (i) First consider the case inf s£ sw(s) > wo. Use Theorem l2.ll part ■?., 

P(W < w) = P(YV <w)=p(y< inf ^4^1 = 1 - £ fsup ^4^1 (2.11) 

V sesV(s)J \ seS w{s)J 

hence, 

P(W ^ w) =E (sup-^4 ) ■ ( 2 - 12 ) 

Ws w(s) J 

(ii) The probability measure of W on C ( j (S r ) can be extended to a measure 
v on C + (S) while keeping the homogeneity relation (|2.3|) as follows: for any 
Borel set B such that 

sup sup/(s) < loq and < e < inf sup/(s), 
feBseS f£ B ses 

we define 

This measure (the same as in (jTTTJ)) is homogeneous of order -1: 

v{rB) = r~ x v(B) for all r > and B E B (C + (S)) . 

Then, the probability distribution of W is the restriction of v to C+ (S) i.e., for 

BeB(C+(S)), 



P{W eB) = pif£B, sup /(a) > wo ^ . (2.13) 

Hence, by the homogeneity property of v, (J2.13I) and (12.12)) in that order: 

i' {/ S? ^} = — — z/ \f t- 



inf se g iy(s) [ inf se sw(s) 

" Pf^ ./"" Wfsup^). (2.14, 



inf seS w(s) V inf se s w(s) ) \ seS w(s) 

By (|2.13[) . elementary set- measure operations and (|2.14[) in that order: 

P(W£w) = v{f-£wJ<£uo} 

= v{f i w} + v{f £ uj Q }-v{f ^ w or / ^ w } 

= pfsu P ^) + l-p(sup^_ 
V se s w(s) / V seS w(s) A wo 



D 



Note that E (sup sGi § W ^ W0 \V G B j = 1, whenever p(B ) > 0, which 
links the results of Propositions 12.11 and 12.21 
The following formulas might also be useful: 



Corollary 2.4. Let w, W G C + (S), with W simple Pareto process. Then: 

w{s) 

7R 



a) With B x = {f€ C+ (S) : sup seS ## > 1 and mi seS f(s) > 0}, 



P(W> W ) = l P(Bi)E{inUes^\Ve Bl ) if p( Bl ) > 
\ ifp(B 1 ) = 0. 

In particular, if P(V > 0) > and sup seS w(s) > uj , 

P(W> w) = P(V>0)E[ inf —H-\V> ) . (2.16) 

\sGS W(S) ' J 

b) If w > and sup seS w(s) > u> , 

P(W >w)=E [inf ^4 I . (2.17) 

Vses w(s) J 

c) If E (inf se5 V{s)) > 0, for xeR, 

P{W > x\W > uj ) = { *' , X - W ° (2.18) 

[ uo/x, x > Wo- 

dj // £ (inf seS F(s)) > 0, for x G R and for each s G S, 

P(W(s)>x\W(s)> Uo ) = { 1 J o/Xi 112 (2.19) 

Proof. For (|2.15[) . similarly to the proof of Proposition 12. 11 

P(W >w)=P[Y> sup ^44 and inf V(s) > 
V ses V(s) ses 

= f iniJ^\dp(v)=p(B 1 )E(mi^f\\VeB 1 

J Bl seS W{S) \s£S w(s) ' 

For ([2~T7l) . 

P(W >w) = P(YV >w)=P\Y > sup —^4- ) =E[ia£ —H- 

using Y standard Pareto and independent of V. 
For c) note that 

P (W > w ) = P [Y inf V(s) > uj ) = Emm ( 1, lr " 



ses J \ cjo 

— E inf V(s) > 0. 

wq s es 



n»)\ -,, 



Then ([2~T51) follows from (|2T7|I . 
For dj note that, if x > cjo, 

P(W(s) > ») = P(ry(s) > x) = Pmin I 1, -^1 I = a; _i EF(s) > 0. (2.20) 

D 

Relation (I2.19[) indicates that one-dimensional marginals, conditional on 
the process being larger than u)q, behave like Pareto; similar observation is in 
Rootzen and Tajvidi (2006) in the context of lower-dimensional distributions. 

Let si, S2 £ S and x > luq. From (|2.20[) . 

P{W{ Sl )>x) = Eiy{Sl)) >0, f = 1,2 

x 

and, similarly 

P mSl )>x,W( S2) >x) = E ^ AV ^\ 

X 

Hence P (W(s x ) > c, VF(s 2 ) > c) = P (W(ai) > c) P (W(s 2 ) > c) for all c> w 
is equivalent to verify E (V(s x ) A V(s 2 )) = c _1 JS (V(si))E (V(s 2 )) for all c > 
wo, which is impossible. That is, independence in the Pareto process between 
any two points is impossible. 

For later use we define next max-stable processes and give well known prop- 
erties useful for our purposes. 

Definition 2.2. A process r\ — {f7(s)}seiR € C(M) with non-degenerate margi- 
nals is called max-stable if, forr\\, 772, ... , independent and identically distributed 
(i.i.d.) copies ofn, there are real continuous functions c n = {c„(s)} se R > and 
d n = {d n (s)}sem such that, 

Vi ~ d n d n ,, 1 „ 

max = ?y for all n = 1, 2, . . . . 

l<i<n C n 

It is called simple if its marginal distributions are standard Frechet, and then it 
will be denoted by fj. 

Proposition 2.3 (Penrose(1992), Theorem 5.). All simple max-stable processes 
can be generated in the following way. Consider a Poisson point process on 
(0,oo] with mean measure r~ 2 dr. Let {Zi}°l 1 be a realization of this point 
process. Further consider i.i.d. stochastic processes Vi,V2, ■ ■ ■ in C + (R) with 
EV\{s) = 1 for all s £ R and -Esup sGR V(s) < 00. Then 

fj= d max Zi V,. (2.21) 

i=l,2,... 

Conversely each process with this representation is simple max-stable (and one 
can take V such that sup sGR V(s) — c a.s. with c > 0). 



The finite dimensional distributions of fj where computed in de Haan (1984). 
With the above representation for simple max-stable processes, for s\, . . . , s n £ 
R, jbi, . . . ,x n > 0, for all n £ N, 

G(xi, . . . , x n ) — P(f}(si) < xi, . . . , fj(s n ) < x n ) — exp ( -E max 



l<i<n Xi 

(2.22) 
Note that fj is the maximum of infinitely many processes whereas W depends 
on just one of those processes (Theorem 12.11 3.). 

2.1.1 The finite-dimensional setting 

Simple Pareto random vectors can be constructed similarly as in Theorem 12.11 
- Definition EU 



Theorem 2.2. Let (W\, . . . , Wd) be a random vector in W± = [0, oo) d and wo 
a positive constant. The following three statements are equivalent: 

1. (a) E(Wi/msxi=i 1 ... t d Wi) > for all i = 1, . . . ,d, 

(b) P (maxi = i ! ... ! d Wi/u>Q > x) = x^ 1 , for x > 1 (standard Pareto distri- 
bution), 

(c) 

l ' ij (W 1 ,...,W d ) , ... \ ( wo(Wu...,W d ) c _ 

P I 777- € B max Wi > ruj ) = P [ — - £ B 

maxi=i d Wi i=i,...,d / \ maxi=i d VVi 



/or all r > 1 and B £ B (D+) with 



(2.23) 



££o : = K w i' ■ • ■ . «>«0 6 R+ : . max m a = uj }. (2.24) 



g. ( a J P (max i= i ) ... )d W > w ) = 1, 

(7»; £ (Wi/ max i= i,...,d Wi) > for alii = l,...,d, 

(c) 

P((W U . ..,W d )£ rA) = r- x P{{W u . ..,W d )£ A), (2.25) 

for all r > 1 and A £ B (-D+J wift 

£>+ :={(wi,...,Wd)eR+: max w % > u }. (2.26) 

i— l,...,a 

5. (Wi., . . . , W 7 ^) = Y(Vx, . . . , Vd), /or some Y and random vector (V\, . . . , Vd) £ 
R^ verifying: 

(a) maxi = i d V% = ^o a.s., and EVi > /or all i — 1, . . . ,d, 

(b) Y is a standard Pareto random variable, Fy{y) = 1 — 1/y, 2/ > 1, 
fcj Y and V are independent. 



10 



Proof. Similar to the proof of Theorem 12.11 □ 

Definition 2.3. The random vector (Wi, . . . , Wd) £ K+ characterized in Theo- 
rem \2.1[ with threshold parameter u>q , is simple Pareto. The probability measure 
m (|2~23f i.e., 

K^ff ^' ^ A for B eB(D+ o ), (2.27) 

is called the spectral measure. 

From the finite dimensional representation it follows again that for having 
all marginals Pareto, one would have to have max(V(si), . . . , V(s n )) = u)q, for 
all si, . . . , s n € S and all integer n, which corresponds to V = loq a.s., i.e. the 
complete dependence case. 

Nonetheless, we see that it is still possible that some finite dimensional 
marginals of a Pareto process are still Pareto. For example, consider a situation 
where the maximum of the process may only occur at some fixed locations in 5* 
and the related finite dimensional random vector includes these locations. 

One can give formulas for distribution functions, following similar reasoning 
as before. The correspondent to Proposition 12 . 1 1 is given by, 

P(W l < Wu ...,Wd<w d ) = { P(Bo){l-E(^ Ia %\B )} ii P (B )>0 

\ ifp(£ ) = 0, 

(2.28) 
where Iq = {1 < i < d : Wi = 0}, Iq = {1 < i < d : Wi ^ 0} and Bq = 
{(Vi, . . . , V d ) : Vi = for i G I and min ie/o ^ > 1}. 

The correspondent to Proposition 12. 2[ with Wi > for all i = l,...,d, is 
given by, 

P(Wi <w 1 ,...,Wd<Wd) = E[ max — ) -E[ max — ) , (2.29) 

\l<i<d Wi A WO J \l<i<dWiJ 

which corresponds to the one given in Rootzen and Tajvidi, in the correspondent 
region. 

Note that Rootzen and Tajvidi's formula (i.e. (2) in Definition 2.1) only 
holds for the vector x larger than the vector of the lower endpoints of the 
marginal distributions. The following example illustrates this. Consider the 
model related to two independent unit Frechet. Then Rootzen and Tajvidi's 
formula corresponds to 

H(x,y) = [ 5 G^o - ^o " S " \) . (^ > and y > 1) or (y > and x > 1) 
1 0, < x < 1,0 < y < 1 

but, it is undetermined otherwise. Then note that their formula is not able to 
capture the positive mass on the axis. 



11 



By direct calculations, or alternatively applying (J2.28I) with the bivariate 
Pareto random vector (YB,Y(1 ~ B)), with B Bernoulli (1/2), the correct 
distribution function is, 



P(YB < x,Y(l-B) < y) = < 



, 1(1-1) ifx>l,0<y<l 



\\}-k) ify>l,0<a:<l 

otherwise . 



(2.30) 



Another remark on Rootzen and Tajvidi (2006): their Theorem 2.2(h) is not 
completely correct. It is not sufficient to require condition (6) of the same paper 
for x > 0. A counter example is given by 

/I 1 \ 1/2 

P(X > x or Y > y) = I V 2 ^ + V 2 ^ J , x V y > 0, 

and zero elsewhere. This distribution satisfies (6) for x > but not for all x 
and it is not a generalized Pareto distribution. 

2.2 The generalized Pareto process 

Let C(S) be the space of real continuous functions on S with S C R d compact. 
The more general processes with continuous extreme value index function 7 = 
{7(s)} s gs, location and scale functions /i = {/i(s)} s es and a = {a(s)} se s is 
defined as: 

Definition 2.4. Let W be a simple Pareto process, /j,, (7,7 € C(S) with a > 0. 
The generalized Pareto process Wn >a ,^ £ C(S) is defined by, 

W - 1 
W IM , tx „=n + o- (2.31) 

7 

with all operations taken componentwise (recall the convention explained in the 
end of Section Qp. 



The result corresponding to Corollary |2.1l is. 



Corollary 2.5. The random variable sup s£ 5 < I 1 + 7(5) 



l ^*^) 1/7 \" 1 



/ias standard Pareto distribution. 

Related with stability or homogeneity properties we have: 
Proposition 2.4. For any generalized Pareto process W^ tCyn , 

pi'ii +7 ^)%X^f(i +7 ^) 1/1 e A 

(2.32) 
12 



for all r > 1 and A £ B [C£ (5 1 )) . Moreover, there exist normalizing functions 
u(r) and s(r) such that 

= p[(l + 7^^) 1/ %A (2.33) 

for allr>landAeB (C*+(S*)) . 

Conversely, ([03} and sup sgg | ( 1 + 7(5) H,ftM | ) ) ^ M ) " 1 w^ 1 &em# 

standard Pareto distributed, imply (|2.32p . 

Proo/. Relation (HO!?)) is direct from Definition |2"H and (ET5)) . Then, with u(r) = 
/i + cr(rT - l)/ 7 and s(r) = ar< and, for all r > 1 and A e B (C+ (S)), 



/M('-7^^) V7 ^i) ,, Vi . A v 



/> ((1+7 i^^) 1/7 ^ rcJ0 



p M +7 "^°.7 ^ g^ 



by (|232j) and Corollary [231 

Conversely, for all r > 1 and 4e6 (C+ (S)), 



''((l + 7^^f^ 



(su PseS (l + 7^T^) 1/7 -o- 1 >l) 



^((1+7^^) -1) /, „. _.,,. 



P|(l + 7- o 



by ([2331 and sup sSg I (l + 7(5) ff ""; ( ( ' ) ) " >l(8) ) '' 1 w^ 1 being standard Pareto 
distributed. □ 

The result corresponding to Proposition ^. 2l on distribution functions is now, 
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for w > 0: 



| W ( S )-M s)y /7(a) 

\ v cr(s) 



i^WW <«;) = £* sup V(«) I l + 7(g) ,C - 1 Aw 

-VtW 



4u P n S )(i + 7(,)*^ 



forl+ 7 (w-/i)/aeC + (S). 



3 Domain of attraction 

The maximum domain of attraction of extreme value distributions in infinite- 
dimensional space has been characterized in de Haan and Lin (2001). This result 
leads directly to a characterization of the domain of attraction of a generalized 
Pareto process. 

Let C(S) be the space of real continuous functions in S, with S C M d 
some compact subset, equipped with the supremum norm. The convergences 
below — > d denote weak convergence or convergence in distribution. Denote by 
fj = {f)(s)} se s an y simple max-stable process in C + (S) (cf. Definition ^. 21) . Any 
max-stable process r] = {?/( s )}ses m C(S) can be represented by rj = (fj 1 — l)/"f, 
for some fj and continuous function 7 = {7(s)} s eS- F° r simplicity we always 
take here 

Cf(S) = {f €C + (S): sup /(«)> 1}, 

ses 

i.e., w.l.g. consider the constant wo introduced in Section [2] equal to 1. 

The maximum domain of attraction condition in C(S) can be stated as: 

Condition 3.1. For X, X\, X2, ■ ■ ■ i.i.d. random elements ofC(S), there exists 
a max-stable stochastic process rj G C(S) with continuous index function 7, and 
a s (n) > and b s (n) in C(S) such that 



Xi{s)-b s {n) 

max -— ^— L 

i<i<n a s (n) 



> ses 



on C(S). The normalizing functions are w.l.g. chosen in such a way that 
- log P(r/(s) <x) = (1 + -f(s)x)- 1 / 1 ^ for all x with 1 + j(s)x > 0, s £ S. 

Next is an equivalent characterization of the domain of attraction condition, 
in terms of 'exceedances'. 

For X a random elements of C(S), suppose the marginal distribution func- 
tions F s (x) — P(X(s) < x) are continuous in x, for all s e S. To simplify 
notation write the normalized process, 

T t X=(l + 1 ^-^\ 1 '\ (3.2) 
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for 7 = {7(s)} s6 5 and normalizing functions a t — {at(s)} se s > and b t — 
{b t (s)} seS , all in C(S). 

Condition 3.2. For X £ C(S) suppose, for some at > and b t in C(S), 

lim tP ( X ^ 7 f t(S) >x]= (1 +7(g)^)~ 1/7(s) i l + 7(s)^>0, (3.3) 
t^oo y a t (s) J 

uniformly in s, 

P (sup seS T t X(s) > x) 1 
lim — t — ^-^2 LJ i = _ fo r a // x > i (3.4) 

t^oo P( SU p sGS T t X(s) > 1) .t' J ' v y 

lim P f ^ , 6 B| supT t X( S ) > l) = p(P), (3.5) 

*->oo Vsup sGS r t A(s) ses y 

/or eac/i B £ B (C^ (5 1 )) wift p(dB) = 0, mi/i p some probability measure on 
Ct(S). 

Note that this is the same as for max-stable processes; cf. Theorem 9.5.1 in 
de Haan and Ferreira (2006). Note also that (|3.4p - (|3.5l) specify a simple Pareto 
process in the limit. 



Theorem 3.1. Conditions \3.1\ and \3.2\ are equivalent. Moreover, the limiting 
probability measure in (|3.5j) is the probability measure of V in representation 
(l2~2Tll . 

Proof. Cf. Theorem 9.5.1 in de Haan and Ferreira (2006). The normalization 

there is 

1 



t(l-F s (X(s))) JseS 
instead of (13.21) but the results are the same. □ 



The following is a direct consequence: 
Corollary 3.1. Relations (I3.4[) - (13.51) imply 



lim P T t X £ A\ supTtX(s) > 1 = P(W S A), 

*^°° \ s£S J 

with A £ B (Ci (<!?)), P(<9A) = and W some simple Pareto process. 
The converse statement of Corollary 13. II is as follows: 



Theorem 3.2. Suppose that there exists a function b u — {& u (s)} se s, that is 
continuous in s for each u and increasing in u, and with the property that 
P(X(s) > b u (s) for some s £ S) — > as u — > oo, and a continuous function (in 
s), a u = {a u (s)} se s > such that, for some probability measure P on B{C{S)), 

lim P ( X ~ - £ A\X(s) - b u {s) > for some s £ S ) = P(A), 
u->oo \ a,. 



for allA£B (C(S)) and P(dA) = 0. Then Conditions\3j\ and\3M are fulfilled. 
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Proof. By the conditions on b u , we can determine q = q(t) such that P(X(s) > 
b q (t)(s) for some s € S) = l/t. Then with b t (s) — b q ( t )(s) and a t (s) — a q ( t )(s), 

lim tP ( — — G C and X(s) > b t (s) for some s G S ) = P(C), 
t-s-oo \ a t J 

for all C G B (C(5)) and P(<9C) = 0. In particular, if inf{sup seS f{s) : f G 
C} > we have 

lim tp(Z— ^ e c) =P(C). (3.6) 

t— >oo \ at / 

We proceed as usual in extreme value theory. Fix for the moment s £ S. It 
follows that for x > 

lim tP (X(s) > 6 t (s) + xat(s)) - P{f : /(*) > x}. 

Let C7 S be the inverse function of l/P(X(s) > x) and V(s) be the inverse 
function of 1/P {/ : /(s) > a;}. Then 

,. f/te(s) -&t(s) 

hm — = V x (s), for x > 0. 

t^oo at(s) 

It follows (Lemma 10.4.2, p. 340, in de Haan and Ferreira (2006)) that for some 
real 7(s) and all x > 

lim M«)-W = ^ ( ' ) -l and lim M£) =x7(s) . (3J) 

t-s-oc at(s) 7v s ) t-*oo a t (s) 

Since the limit process has continuous paths, the function 7 must be continuous 
on S. 

Now replace t in (|3.6p by ct where c > 0. Then 

lim tP ( btis) ~ b r {3) + ^^ec)= 1P(C) 
t->oo y atc(s) atc(s) a t J c 



X - b {\ lh ^<^^h\ 1- 



hence, by (J3.7I 



lim iP 1+7 ec(l+ 7 C) 1/7 =-P(C) 



and by (|3.6 



I_ ^"W1 j.^/nrNl/7 



l/ 7 

HmiPl I 1 + 7- — -J e(l + 7C*) i/7 J =P(G). 

Write P(4) = P ((^t - 1) /i). Then 

lim tP(T t X G A) =P(vl), 

with P(cA) = cr^A), for all c> and A G B (C(S)) such that inf{sup seS f(s) 
f £ A} > 1 and P(9A) = 0. The rest is like the proof of the equivalence between 
(2b) and (2c) of Theorem 9.5.1 in de Haan and Ferreira (2006). □ 
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Example 3.1. Any max-stable process verifies Condition \3.'A with p given by 
the probability measure of V from (|2.2ip . 

Example 3.2. Any Pareto process with spectral measure p is in the domain of 
attraction of a max-stable process where the underlying process V (cf. represen- 
tation (|2.2ip ) has probability measure p. 

Example 3.3. The finite dimensional distributions of the moving maximum 
processes obtained in de Haan and Pereira (2006) can be applied to obtain the 
correspondent finite dimensional distributions of the correspondent Pareto pro- 
cess, in the appropriate region. 

Example 3.4 (Regular variation (de Haan and Lin 2001, Hult and Lindskog 
2005) ). A stochastic process X in C(S) is regularly varying if and only if there 
exists an a > and a probability measure p such that, 

P(su PseS X(s) > to, JT/Bup seS X(s) £ ■) ^ * >„,*-> oo, 

P (su PsGS X{s) > t) 

(3.8) 
on {/ £ C(S) : sup seS f(s) = 1}. Hence, a regularly varying process such that 
(|3.3p holds for the marginals, verifies Condition \3.2\ with 7 = 1/a, b t — t and 
a t = t/a; note that the index function is constant in this case. 

On the other hand, the normalized process tT t X with T t X verifying (|3.4p ~ 
(|3.5p . verifies regular variation with a — 1 and spectral measure p on Ci(S). 

Remark 3.1. As seen in Section \2.1.1[ our analysis is also valid in the finite- 
dimensional set-up. The main difference from Rootzen and Tajvidi (2006) is 
that their analysis is entirely based on distribution functions whereas ours is 
more structural. Here are some remarks on their domain of attraction results. 

Let F = 1—F with F some d—variate distribution function, x ~ (x\, X2, ■ ■ ■ , Xd) £ 
M. d , and u(-) = (ui(-), v>2{), ■ ■ ■ , Ud(')) o,nd a the normalizing functions consid- 
ered in Rootzen and Tajvidi (2006) (see e.g. their definition o/X Uy ). By using 
a(xt)/a(t) -)■ (a; 71 , a; 72 , . . . , x ld ) and 
(u(xt)-u(t))/a(t) -> (22^ i) 22^ 2 ,...,2l±^J, t -4 oo, for some reals 

7i j 72 , ■ • ■ , Id (cf. proof of Theorem 2. 1 (ii) in Rootzen and Tajvidi (2006)) 
by 

F*(x) := F (u 1 (xi),u 2 (x 2 ), ■ ■ ■ ,u d (x d )) , 

one simplifies their relation (19) to 
tF*(tx) -+- log G[^- 



71 72 Id 

and one simplifies their relation (6) to 

P (X* < ix|X* £tl)=P (X* < x) 



for t > 1. Hence one can take u(i) := f , 

of that paper. 



£^i . . . , ^-^i ) in Theorem 2.2 
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4 View towards application and simulation 

4.1 Towards application 

The following remark may be useful for application. Suppose the domain of 
attraction condition (jl.ip holds. Define B = {/ e C + (S) : / > 1}. Let A be a 
Borel set in C + (S). Then applying (|1.1|) twice we get 

lim P (T t X e A\T t X € B) = 



This is the content of Corollary 13.11 It gives a limit probability distribution on 
B. 

A similar reasoning holds with B replaced by a different set B' as long as 
inf{sup sGS f(s) : f G S'} > 0. Consider in particular B' = {/ G C + (5) : 
r t /(»i)>lfori=l,...,p}. Then 

lim P (T t * 6 A|T t X G B>) = V{A P U B ' ] (4.1) 

i^oc ' V\B ') 

which again is a probability distribution. 

Now we proceed as in the peaks-over-thrcshold method for scalar obser- 
vations: let k = k(n) be a sequence of integers with lim„_ i . 00 k(n) — oo and 
lim n ^.ook(n)/n = 0, as n — > oo. Suppose that we have n independent observa- 
tions of the process X in the domain of attraction. Select those observations 
satisfying X(si) > b n /k{si), for i = 1,2,..., p. The probability distribution 
of those selected observations is approximately the right-hand side of (|4.1[) . 
This seems a useful applicable form of the peaks-over-threshold method in this 
framework as it suggests estimating the spectral measure using observations 
that exceed a threshold at some discrete points in the space only. 

4.2 Towards simulation 

'Dcltares' is an advisory organization of the Dutch government concerning (among 
others) the safety of the coastal defenses against severe wind storms. One stud- 
ies the impact of severe storms on the coast, storms that are so severe that 
they have never been observed. In order to see how these storms look like it 
is planned to simulate wind fields on and around the North Sea using certain 
climate models. These climate models simulate independent and identically dis- 
tributed (i.i.d.) wind fields similar to the ones that could be observed (but that 
are only partially observed). Since the model runs during a limited time, some 
of the wind fields will be connected with storms of a certain severity but we 
do not expect to see really disastrous storms that could endanger the coastal 
defenses. The question put forward by Deltares is: can we get an idea how the 
really disastrous wind fields look like on the basis of the 'observed' wind fields? 
We want to show that this can be done using the generalized Pareto process. 

Consider a continuous stochastic processes {X(s)} se s where S is a compact 
subset of M. d . Suppose that the probability distribution of the process is in 
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the domain of attraction of some max-stable process i.e., there exist functions 
a n (s) > and b n (s) (s G S) such that the sequence of i.i.d. processes 



max 

Ki<r. 



Xj(s)-b n (a) 

a n {s) 



s£S 



converges to a continuous process, say 77, in distribution in C(S). Then r\ is a 
max-stable process. 
Define 



T t X{s) := (l+7(a 
Rr t x ■ 



X(s)-b t (s) 



<h(s) 



l/ 7 (s) 



sup T t X(s). 
ses 



As before, suppress the s from now on. Then, with t some large constant, 
T t toT t X — btt 



P 



P 



att a 



Ch 



£A\R Tt x > 1 



(4.2) 



*>(l + 7^) 



l/ 7 



1 



Otto 



bttg - b t 
Otto 



G A\R Tt x > 1 



a t tl 1 + 7 



X-b t 



_ f V btt 



a tt 



= pta&fx^H^ 



a tt 



a 1 



att 
btt -h tf Q - 1 



G A \R Tt x > 1 



o, 



7 



G A\R TtX > 1 



p^^ i + t /»' 



ni 



a t 



t 1 - 1 
l 1 



«, 



G 4% > 1 



Since, 



a tt (s)t 



-y(«) 



1 and 



6 tt0 ( s )-6 t ( s ) t2 (s) -l 



OtU) 



at(s) 



7(s) 







uniformly for s £ S, the limit of this probability, as t — >• 00, is the same as the 
limit of 

X — ^GA|i? TtX >ll (4.3) 

at J 

which is P(iy G A) by CorollaryO 

In this subsection we are not so much interested in estimating the joint limit 
distribution (which is the peaks-over-threshold method) but in the fact that the 
two conditional distributions (14. 2[) and (j4.3[) are approximately the same. 

Suppose for example that we have observed wind fields over a certain area 
during some time. Then we are likely to find some rather heavy storms i.e. ones 
that satisfy X ^ b n . These are the moderately heavy storms. However we want 
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to know how the storm held of a really heavy storm (i.e. X ■£. &at with N > n) 
looks like. That is exactly what relation (|4.2[) does. Take a moderately heavy 
storm X and transform it to T£~— T„X. This results in the storm field of a 

n n n 

really heavy storm by relation (|4.2I) . 

Notice then what we do here is similar to prediction or kriging, not estimating 
a distribution function. 

The reasoning above also holds with estimated functions of 7, a and &, on 
the basis of A:-th upper order statistics and taking t = n/k. 

Under the above framework, we propose the following simulation method: 

(1) Let Xi,X2, ■ ■ ■ , X n be i.i.d. and let the underlying distribution satisfy the 

conditions above, namely that the probability distribution is in the domain 
of attraction of some max-stable process. 

(2) Estimate the functions 7, a and b (de Haan and Lin (2003), Einmahl and 

Lin (2006)); denote the estimators by 7, a and b. Note that this procedure 
provides us with a number k that reflects the threshold for estimating the 
parameters. 



(3) Select from the normalized processes 

T n / k Xi := 1+7 



Xi — b n / k 



1/7 



n/k J 

those that satisfy Xi{s) > b n /k{s) for some s £ S i.e. for which Rr„ /k Xi > 
1. 

(4) Multiply these processes by a (large) factor to; this brings the processes to 

a higher level without changing the distribution essentially. 

(5) Finally undo the normalization i.e., in the end we obtain the processes 

T£~/ k tof n / k Xi for those X { for which RT n/k x t > 1. 

These processes are peaks-over-threshold processes with respect to a much 
higher threshold (namely b s (tto)) than the processes Xi for which RT n/k Xi > 
1 (with threshold b s (t)). 

Remark 4.1. Note that an alternative procedure under the maximum domain of 
attraction condition would be, first to estimate the spectral measure and then to 
simulate GP from there. But estimation of spectral measure is more difficult (de 
Haan and Lin (2003)) although this procedure is less restrictive on the number 
of observations than can be simulated. 
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o.o 0.2 0.4 o.e o.s 



O.O 0.2 0.4 0.6 O.S 



Figure 1: (a) The processes T n / k Xi for which Rr n/k Xi > 1 obtained from moving 
maximum process with standard Gaussian density; (b) the corresponding lifted 
processes T^j k toT n / k Xi with to = 10 

4.3 Simulations 

We exemplify the lifting procedure with the process X(s) = Z(s) J ( s \ with 
j(s) = 1 — s(l — s) 2 , s G [0, 1], and Z is the moving maximum process with 
standard Gaussian density. The Z process can be easily simulated in R-package 
due to Ribatet (2011). In Figures HHH are represented the 11 out of 20 of these 
processes, normalized and for which R,T n/k Xi > 1; an d the corresponding lifted 
processes T^~ /k t T n / k Xi with to = 10. 
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